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Abstract 

We achieve a quantum speed-up of fully polynomial randomized approximation 
r~| . schemes (FPRAS) for estimating partition functions that combine simulated annealing 

Q-f with the Monte-Carlo Markov Chain method and use non-adaptive cooling schedules. 

The improvement in time complexity is twofold: a quadratic reduction with respect 
j3 . to the spectral gap of the underlying Markov chains and a quadratic reduction with 

respect to the parameter characterizing the desired accuracy of the estimate output by 

D" 1 



the FPRAS. Both reductions are intimately related and cannot be achieved separately. 

First, we use Grover's fixed point search, quantum walks and phase estimation to 
efficiently prepare approximate coherent encodings of stationary distributions of the 
Markov chains. The speed-up we obtain in this way is due to the quadratic relation 
between the spectral and phase gaps of classical and quantum walks. Second, we 
generalize the method of quantum counting, showing how to estimate expected values 
of quantum observables. Using this method instead of classical sampling, we obtain 
the speed-up with respect to accuracy. 
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1 Introduction 



Quantization of classical Markov chains has been crucial in the design of efficient quantum 
algorithms for a wide range of search problems that outperform their classical counterparts. 
We refer the reader to the survey article [1] for a detailed account of the rapidly growing 
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collection of quantum-walk-based search algorithms. In this context, we also point to the 
work |2J, where the authors apply quantized Markov chains to speed up search algorithms 
based on simulated annealing for finding low-energy states of (classical) Hamiltonians. 

In this paper, we extend the scope of use of quantized Markov chains beyond search 
problems. We show how to employ them to speed up fully polynomial-time randomized 
approximation schemes for partition functions, based on simulated annealing and the Monte 
Carlo Markov Chain (MCMC) method. To achieve this improvement, we rely on Szegedy's 
general method to quantize classical Markov chains [Sill], which we review in Appendix IA1 
This method gives us a unitary quantum walk operator W(P) corresponding to one update 
step of the classical Markov chain P. The complexity of the classical algorithms we are 
speeding up is measured in the number of Markov chain invocations. Similarly, we express 
the complexity of our quantum algorithm as the number of times we have to apply a quantum 
walk operator. As shown in [S], this operator can be implemented precisely and efficiently. 

Sampling from stationary distributions of Markov chains combined with simulated an- 
nealing is at the heart of many clever classical approximation algorithms. Notable examples 
include the algorithm for approximating the volume of convex bodies (6] , the permanent of 
a non-negative matrix [7J, and the partition function of statistical physics models such as 
the Ising model [S] and the Potts model [9]. Each of these algorithms is a fully polynomial 
randomized approximation scheme (FPRAS), outputting a random number Z within a factor 
of (1 ± e) of the real value Z, with probability greater than |, i.e. 



in a number of steps polynomial in 1/e and the problem size. 

We show how to use a quantum computer to speed up a class of FPRAS for estimating 
partition functions that rely on simulated annealing and the Monte Carlo Markov Chain 
method (e.g. [HIS])- Let us start with an outline of these classical algorithms. Consider a 
physical system with state space Q and an energy function E : Q — > R, assigning each state 
(jGOan energy E(o~). The task is to estimate the Gibbs partition function 



at a desired (usually low) temperature Tp. The partition function Z(T) encodes the thermo- 
dynamical properties of the system in equilibrium at temperature T, where the probability 
of finding the system in state a is given by the Boltzmann distribution 



It is hard to estimate Z(T) directly. The schemes we want to speed up use the following 
trick. Consider a sequence of decreasing temperatures T > 7\ > • • • > T(, where T is a 
very high starting temperature and Tg = T F is the desired final temperature. Then, Z{T F ) 




(1) 




(2) 
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can be expressed telescoping product 



Z(T F )=Z ^... 




Z (a ai ■ ■ ■ ai- 2 a e -i) , 



(4) 



Q 



where Zj = Z(Tj) stands for the Gibbs partition function at temperature Tj and a, = Z i+ i/Zi. 
It is easy to calculate the partition function Z = Z(T ) at high temperature. Next, for each 
i, we can estimate the ratio ai by sampling from a distribution that is sufficiently close to 
the Boltzmann distribution 7Tj ([3]) at temperature Tj (see Section [2] for more detail). This is 
possible by using a rapidly-mixing Markov chain Pi whose stationary distribution is equal 
to the Boltzmann distribution 7Tj. 

To be efficient, these classical schemes require that 

1. we use a cooling schedule such that the resulting ratios ctj = Z (T i+ i) / Z (Ti) are lower 
bounded by a constant c _1 (to simplify the presentation, we use c = 2 from now on), 

2. the spectral gaps of the Markov chains Pi are bounded from below by 5. 

The time complexity of such FPRAS, i.e., the number of times we have to invoke an update 
step for a Markov chain from {Pi, ... , Pe-i}, is 



where O means up to logarithmic factors. 

Our main result is a general method for 'quantizing' such algorithms. 

Theorem 1. Consider a classical FPRAS for approximating the Gibbs partition function of 
a physical system at temperature Tp, satisfying the above conditions. Then, there exists a 
fully polynomial quantum approximation scheme that uses 



applications of a controlled version of a quantum walk operator from {W(Pi), . . . , W(Pe_i)} . 

The reduction in complexity for our quantum algorithm (in comparison to the classical 
FPRAS) is twofold. First, we reduce the factor 1/5 to \j\fb by using quantum walks instead 
of classical Markov chains, and utilizing the quadratic relation between spectral and phase 
gaps. As observed in [I], this relation is at the heart of many quantum search algorithms 
based on quantum walks (see e.g. p] for an overview of such quantum algorithms). Second, 
we speed up the way to determine the ratios ai by using the quantum phase estimation 
algorithm in a novel way. This results in the reduction of the factor 1/e 2 to 1/e. 

The quantum algorithm we present builds upon our previous work |10| . where two of us 
have shown how to use quantum walks to approximately prepare coherent encodings 




(5) 




(6) 




(7) 
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of stationary distributions 7Tj of Markov chains Pi, provided that the Markov chains are 
slowly- varying. Recall that a sequence of Markov chains is called slowly-varying if the sta- 
tionary distributions of two adjacent chains are sufficiently close to each other. As we will 
see later, this condition is automatically satisfied for Markov chains that are used in FPRAS 
for approximating partition functions. 

Note that our objective of approximately preparing coherent encodings of stationary 
distributions is different from the objective in [H], where the author seeks to speed up the 
process of approximately preparing density operators encoding stationary distributions. For 
our purposes, we have to work with coherent encodings because otherwise we could not 
achieve the second reduction from 1/e 2 to \je. 

The paper is organized as follows. In Section [2] we review the classical FPRAS in more 
detail. We present our quantum algorithm in two steps. First, in Section 13.21 we explain 
how our quantum algorithm works, assuming that we can perfectly and efficiently prepare 
coherent encodings of the distributions ([3]). Then, in Section ETBI we describe the full quantum 
algorithm, dropping the assumption of Section 13.21 and using approximate procedures for 
quantum sample preparation and readout, which are based on the quantum walks. We 
perform a detailed analysis of accumulation of error due to the approximation procedures 
and show that the success probability remains high, establishing Theorem [TJ Finally, in 
Section @] we conclude with a discussion of open questions, the connection of our algorithm 
to simulated annealing, and the directions for future research. 



2 Structure of the Classical Algorithm 

Here we describe the classical approximation schemes in more detail, following closely the 
presentation in Section 2.1]. Choosing a sequence of temperatures T > T\ > ■ ■ ■ > % 
starting with T = oo, and ending with the desired final (low) temperature Ti = T F , we 
can express the Gibbs partition function ([2]) as a telescoping product (jlj). At T = oo, the 
partition function Z is equal to 

z = \n\, (8) 

the size of the state space. On the other hand, for each % = 0, ...,£— 1, we can estimate the 
ratio 

m = — (9) 

in (jl]) as follows. Let X{ ~ 7Tj denote a random state chosen according to the Boltzmann 
distribution 7Tj, i.e., 

Pr(X i = a) = m(<7) . (10) 
Define a new random variable by 

y. = e -(A + i-/K)iW (11) 
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where $ = (kTj) 1 is the inverse temperature (k is the Boltzmann constant). This Y> is an 
unbiased estimator for ai since 

E(Yj) = ^7r,(a)e-( ft + 1 - /31 )^ (12) 
o-efi 

- =-t= a - ■ (14) 

Assume now that we have an algorithm for generating states Xi according to 7Tj. We draw 

m := 64£/e 2 (15) 

samples of Xi and take the mean Y \ of their corresponding estimators Y^. Then, the mean 
Y i satisfies 

Var (FQ _ e 2 Var (Y t ) e 2 

(e (Fj) ) 2 64£( E (F l )) 2 - ier 1 J 

(We have used the assumption | < ctj < 1.) We can now compose such estimates of ctj. 
Define a new random variable Y by 

F = F £ _ 1 F,_ 2 ---F (17) 

Since all Y \ are independent, we have 

E (F) = E (y^O E (y_ 2 ) • ■ ■ E (y ) = o^o^ • • • a = a, 
Moreover, Y has the property 

Var(F) _ E(Ft 1 )---E(F 2 )-E(F,_ 1 ) 2 ---E(F ) 2 



( E ^)) 2 e(fL) 2 ...e(F ) 2 

i Var gVj \ / Var (F ) \ i 
(E(F,_0)V V (E(F )) n 



< fe e2 / 16 ^-l 

,2 



< e 2 /8, 

where we used 1 + x < e x (true for all x) and — 1 < 2x (true for all x G [0, 1]) in the 
last two steps, respectively. Chebyshev's inequality now implies that the value of Y is in the 
interval [(1 — e)a, (1 + e)a\ with probability at least |. 



Of course, we are not able to obtain perfect samples Xi from 7^. Assume now that we 
have X[ that are from a distribution with a variation distance from 7T; smaller than 



d := e7(512£ 2 ). (18) 

Let Y be defined as Y as above, but instead of Xi we use X[. Then, with probability at 
least |, we have Y = Y . To derive this, observe that the algorithm can be thought to first 
take a sample from a product probability distribution tc on the (mi)-fold direct product of 
Q. We denote the probability distribution in the case of imperfect samples by it'. The total 
variation distance between ir and it' is then bounded from above by 



e 2 6A£ i 



Therefore, Y is in the interval [(1 — e)E (Y) , (1 + e)E (Y)] with probability at least |. 

We obtain the samples X- by applying Markov chains Pj whose limiting distributions are 
equal to 71$. Constructing such rapidly- mixing Markov chains is a hard task, but it has been 
done for the Ising model [8] and the Potts model [9]. 



3 Quantum Algorithm 

3.1 Overview 

The classical FPRAS we described in Section [2] consists of 

1. preparing many samples from a distribution close to 7Tj by letting a suitable Markov 
chain mix, 

2. using these samples to approximate the ratios a, in (TjJ, and 

3. composing these estimates of a« into an estimate of the partition function. 

We build our quantum algorithm on this scheme, with two novel quantum ingredients. First, 
instead of letting a Markov chain Pi mix towards its stationary distribution 7Tj, we choose to 
approximately prepare the state \^i) = Yl a V 7r «( cr ) I "); a coherent encoding of the Boltzmann 
distribution. We use a preparation method [10] based on Grover's |-fixed-point search 
[13], efficiently driving the state | vr ) towards the desired state through a sequence of 
intermediate states. 

Second, instead of using classical samples from the distribution 7Tj, we approximate «j 
by phase-estimation of a certain unitary on the state \^i)- This is a new concept, going 
beyond our previous work [10J. This phase-estimation subroutine can be efficiently (albeit 
only approximately) applied by utilizing quantum walks. 

The structure of our algorithm is depicted in Fig. [TJ It consists of successive approximate 
preparations of | tt^) followed by a quantum circuit outputting a good approximation to 
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7T ) — M.q (obtain a ) 



7r ) — > |7ri) — A^i (obtain ai 



ko) -> K) 



F*-i 



) — -M^_i (obtain a^_i) 



Figure 1: Structure of the quantum algorithm. 



(with high probability). Our main result is the construction of a fast quantum version of a 
class of classical algorithms, summed in Theorem [U 
We arrive at our quantum algorithm in two steps. 



First, in Section [3721 we explain how 
to quantize the the classical algorithm in the perfect case, assuming that we can take perfect 
samples Xi from 7Tj. Then, in Section [3T31 we release this assumption and describe the full 
quantum algorithm. 



3.2 Perfect Case 

To estimate the ratios «j in (j3J), the classical algorithm generates random states Xi from 7Tj 
and computes the mean Y ^ of the random variables Yj. The process of generating a random 
state Xi from 7Tj is equivalent to preparing the mixed state 

Pl = J2M°)W)W\- (20) 

Instead of this, we choose to prepare the pure states 

I^H^Tv^Rk)- (2i) 

We call these states quantum samples since they coherently encode the probability distribu- 
tions 7Tj. In this Section, we assume that we can prepare these exactly and efficiently 

The random variable Y{ can be interpreted as the outcome of the measurement of the 
observable 

A i = Y,Vi{a)W){a\ (22) 

in the state p i: where 

yi (<j) = g-Ofc+i-AW . (23) 

With this interpretation in mind, we see that to estimate oti classically, we need to esti- 
mate the expected value Tr(^4jpj) by repeating the above measurement several times and 
outputting the mean of the outcomes. 
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We now explain how to quantize this process. We add an ancilla qubit to our quantum 
system in which the quantum samples |7Tj) live. For each % = 0, ...,£ — 1, we define the 
unitary 

This Vi can be efficiently implemented, it is a rotation on the extra qubit controlled by the 
state of the first tensor component. Let us label 

|^) = ^(ki>®|0». (25) 

Consider now the expected value of the projector 

P = I®|0)<0| (26) 

in the state \ipi). We find 

(il>i\P\ip i ) = (n\Mn) = <x i . (27) 

We now show how to speed up the process of estimating a» with a method that generalizes 
quantum counting [11] . As noted in the beginning of this Section, we assume efficient 
preparation of |7Tj), which in turn implies that we can efficiently implement the reflections 

R i = 2\ir i )(ir i \-1. (28) 

The result of this Section, the existence of a quantum FPRAS for estimating the partition 
function assuming efficient and perfect preparation of |7Tj), is summed in Theorem [2j 

Theorem 2. There is a fully polynomial quantum approximation scheme A for the partition 
function Z . Its output Q satisfies 

Pr [(1 -e)Z < Q < (l + e)Z] > -. (29) 
For each i — 0, ...,£ — 1, the scheme A uses 

o{io g e) (30) 

perfectly prepared quantum samples , and applies the controlled-Ri operator 

log*) (31) 

times, where Ri is as in (2R) . 

To prove Theorem [21 we need the following three technical results. 
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Lemma 1 (Quantum ratio estimation). Let e pe £ (0, 1). For each i = 0, ...,£ — 1 there 
exists a quantum approximation scheme A\ for a*. Its output Q\ satisfies 

Pr [(1 - e pe ) ai < Q\ < (1 + e pe )oJ > -. (32) 

o 

The scheme A\ requires one copy of the quantum sample | tt^) and invokes the controlled-Ri 
operator O (t p e) times, where Ri is as in (E2J). 

Proof. Let 

G=(2|^)(^|-I)(2P-I). (33) 



Define the basis states 



M J1^M, and M = m. (34) 

V 1 - Oj A/Oi 



, , cos# sintf . , 



Restricted to the plane spanned by |7i) and I72), G acts as a rotation 

I / cos# sin# 

I{|7i>,l72>} ~~ \ — sin# cos 8 

where 9 £ [0, ~] satisfies 

008 = 204-1. (36) 
The eigenvectors and eigenvalues of G are 



1 

±i 



A± = e ±l& . (37) 



We do not have direct access to one of these eigenvectors, as the state is in a superposition 
of \G + ) and \GJ). Thus, when we apply the phase estimation circuit for the unitary G to the 
state we will sometimes obtain an estimate of 9, and sometimes an estimate of 2tt — 9. 
However, this is not a problem since both 9 and 2ir — 9 plugged into fl36|) yield the same 
result for Oj. 

We require that the estimate 9' satisfies 

\9'-9\ < 2e pe ai < e pe (38) 

with probability at least |. Using the phase estimation circuit in [12], this means that |- 
has to be an n a = log bit approximation of the phase and the failure probability p / has 

to be less than |. To achieve this, it suffices to use a phase estimation circuit (see Fig. [2]) 
with 

t = log ^ + log f 2 + -L)=0 (loge;, 1 ) 
ancilla qubits. This circuit invokes the controlled-G operation 0(2*) = O (e p e) times. 
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|o> -ThI 



|o) 



_H_ 

IT 



DFT f - 



O 1 



G 21 



G 2 



Figure 2: A basic phase estimation circuit with t ancilla qubits 



Let ct i denote the value we compute from the estimate 6'. We have 



|cKj — a[\ — - \cos9 — cos6>'| < - 



(39) 



showing that the estimate ol i is within ±e pe aj of the exact value cti with probability at least |. 
This completes the proof that the random variable Q\ corresponding to the output satisfies 
the desired properties on estimation accuracy and success probability. □ 

We can boost the success probability of the above quantum approximation scheme for 
the ratio by applying the powering lemma from [16J, which we state here for completeness: 

Lemma 2 (Powering lemma for approximation schemes). Let B' be a (classical or quantum) 
approximation scheme whose estimate W is within ±e pe q to some value q with probability 
| + Q(l). Then, there is an approximation scheme B whose estimate W satisfies 



Pr [(1 - e pe )q < W < (1 + e pe )q] > 1 - 5 boost . 
It invokes the scheme B' as a subroutine O (log5^ st ) times. 



(40) 



With the help of Lemma [2j we now have the constituents required to compose the indi- 
vidual estimates of «j into an approximation for the partition function (J4]). 



Lemma 3. Let e > 0. Assume we have approximation schemes Ao,A\, 
their estimates Q , Qi, . . . , Qi-i satisfy 



, Ai-i such that 



Pr 



l-^)a,< Qi <(l + ±)a, 



Then, there is a simple approximation scheme A for the product a = ao^i • 
result Q = QoQi • • • Qe-i satisfies 



(41) 
The 



Pr [(l-e)a <Q< (1 + e)a] > - 



(42) 
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Proof. For each i = 0, ...,£ — 1, the failure probability for estimating «j is smaller than 
l/(4£). The union bound implies that the overall failure probability is smaller than 1/4, 
proving the lower bound | on the success probability in (T42"j) . 

To obtain the upper bound on the deviation, we now assume that each Qi takes the 
upper bound value. We have 

i=0 

e e/2 -l<e, 

where we have used l+x<e x <l + 2x, which is true for all x G [0, 1]. Thus, in the case of 
success, we have Q < (1 + e)a. 

To obtain the lower bound on the deviation, we assume that each Qi takes its lower 
bound value. We have 

i=0 i=0 

where we have used Uli 3 ^ — TiiVil — J2i \ x i ~ Vi\i ^ rue f° r arbitrary Xi, i/i G [0, 1]. Thus, in 
the case of success, we have (1 — e)a < Q. □ 

We are now ready to prove Theorem [2j 

Proof of Theorem^ For each i = 0, ...,£— 1, we can apply Lemma [1] with the state 
( |25|) and the projector P ( |26|) . This gives us a quantum approximation scheme for ctj. Note 
that to prepare \ipi), it suffices to prepare | tt^) once. Also, to realize a controlled reflection 
around it suffices to invoke the controlled reflection around |7Tj) once. 

We now use the reflection 2|-0j)(0j| — I and set e pe = e/(2f) in Lemma [U With these 
settings, we can apply Lemma [2] to the resulting approximation scheme for with 5b 00 st = 
l/(4£). This gives us approximation schemes Ai outputting Qi with high precision and 
probability of success that can be used in Lemma [3j The composite result Q — Q ■ ■ • Qe-i 
is thus an approximation for a = ato ■ ■ -ae-i with the property 

Pr [(l-e)a<Q< (l + e)a] > ^. (44) 

Finally, we obtain the estimate for Z by multiplying Q with Zq. Let us summarize the costs 
from Lemmas [T]l3j For each i = 0, ...,£— 1, this scheme uses log 5^ ost = 0(log£) copies of 
the state \7ii), and invokes (log f^^) e" 1 = O (^log£) reflections around |7Tj). □ 

3.3 Quantum FPRAS 

In the previous Section we have assumed that we can prepare the quantum samples |7Tj) 
and implement the controlled reflections Ri = 2|7Tj)(7Tj| — I about these states perfectly and 



Q — a 



< 



a 



< 
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efficiently. We now release these assumptions and show how to approximately accomplish 
both tasks with the help of quantum walks operators. We then show that the errors arising 
from these approximate procedures do not significantly decrease the success probability of 
the algorithm. This will wrap up the proof of our main result, Theorem [TJ 

In [10], two of us how to approximately prepare quantum samples |7r;) of stationary 
distributions of slowly-varying Markov chains. Using the fact that the consecutive states 
|7Tj) and |vr i+1 ) are close, we utilize Grover's | fixed-point search [J3] to drive the starting 
state | tt ) towards the desired state 1^) through multiple intermediate steps. Moreover, to 
be able to perform this kind of Grover search, we have to be able to apply selective phase 
shifts of the form Si = | tt^) (7Tj | + (I — |7r,;)(7Tj|) for tu = e ln ^ 3 and to = e~" r/ ' 3 . This is another 
assumption of Section [372] that we have to drop here. Nevertheless, an efficient way to apply 
these phase shifts approximately, based on quantum walks and phase estimation, exists [10J. 

Our task is to show that the approximation scheme from Lemma CD works even with 
approximate input states and using only approximate reflections about the states 1 7r^ ) . Let 
us start with addressing the approximate state preparation. To be able to use the results of 
[TO] , we first have to establish an important condition. For their method to be efficient, the 
overlap of two consecutive quantum samples |7Tj) and |7r i+1 ) has to be large. This is satisfied 
when ai = Z i+1 /Zi is bounded from below by |, since 



Ifaki+i)! 1 



> 



y/ e -/3iE(cr) e -0 i+1 E(a) 



\J Zi Z 



i+1 



E 



-f3i+lE{a) 



y/ZZi+i \J Zi + i 



The following lemma then directly follows from the arguments used in [TOl Theorem 2]. 

Lemma 4. For es > arbitrary and each i = 1, ...,£— 1, there is a quantum method 
preparing a state |7fj) with 



k>-k>|0Hl <e s 



(45) 



where a = O 



is the number of ancilla qubits. The method invokes a controlled version 



of a walk operator from the set {W(Pi) } . . . , W(Pe-i)} 



(46) 



times. 



We choose the preparation method from Lemma H] with 
precision es is 



^. The cost for this 



(47) 
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applications of the quantum walk operator. Recall that when we used Lemma [T] in Section 
13.21 with the state \ipi) (coming from the perfect quantum sample |7Tj)) as input, the success 
probability of the resulting scheme was greater than |. We now use the method given in 
Lemma U on the approximate input = Vi(\TVi) <8> |0)). With our chosen precision for 
preparing the success probability of the approximation scheme of Lemma [T] cannot 
decrease by more than 2 • 

The second assumption of Lemma [1] we need to drop is the ability to perfectly implement 
the reflections Ri = 2|7Ti) (71**1 — I. We now show how to approximately implement these 
reflections. The following lemma follows directly from the arguments in [TUJ Lemma 2 and 
Corollary 2]. 

Lemma 5. For > arbitrary and each i = 1, ...,£ — 1, there is an approximate reflection 
Ri such that 

® |o) 0fe ) = ® |0)® 6 + |0 (48) 

where \<p) is an arbitrary state, b = O ^loge^ 1 log ^= J is the number of ancilla qubits, and 

|£) is some error vector with < e R- It invokes the controlled version of a walk operator 

from{W(P 1 ),...,W(P t - 1 )} 

°O g ^) (49) 

times. 

Recall that in Lemma [U the controlled reflection i?, is invoked 0(1 /e pe ) times. We now 
run this approximation scheme with R^ instead of Ri. The norm of the accumulated error 
vector is 

O (—) ■ e R . (50) 

We choose 

e/? = ^(e pe ) (51) 

to bound the norm of the accumulated error from above by The success probability can 
then decrease by at most 2 • ^. 

Combining these arguments establishes a variant of Lemma [1] without the unnecessary 
assumptions of Section [3721 : 

Lemma 6. Let e pe G (0, 1). For each i = 0, ...,£— 1, there exists a quantum approximation 
scheme A" for a>i. Its estimate Q" satisfies 

Pr [(1 - e pe )« 4 < Q'l < (1 + e pe )a t ] > ^. (52) 
This scheme invokes the controlled version of a walk operator from {W(Pi), . . . , H^-i} 

Of 4= log 2 £+^t= log e- 1 ) . (53) 



pe 

\ / i i 

pe 
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Proof. The success probability of the scheme in Lemma [T] was greater than | . Both the ap- 
proximate state preparation and using approximate reflections reduce the overall probability 
of success by at most Thus the probability of success of the method given in Lemma [T] 



after dropping the unnecessary assumptions is at least f . □ 



We can finally complete the proof of Theorem [T] by following the procedure that led to 
the proof of Theorem [2] in Section 13.21 

Proof of Theorem^ For each i = 0, ...,£ — 1, we proceed as follows. We use the ap- 
proximation scheme A" from Lemma with precision e pe = e/(2£). We then boost the 
success probability of each A" to 1 — ■h by applying the powering lemma (Lemma [2]) with 
$boost — 1/(4^)- This step increases the cost in f l53|) by the factor 0(}og£). This resulting 
scheme Ai now satisfies the properties required for Lemma [31 We can thus use it to obtain 
a composite approximation scheme whose output satisfies 

Pi[(l-e)Z<Q<(l + e)Z] > ~ . (54) 

The resulting cost of this scheme (the number of times we have to invoke the controlled 
quantum walk operators) is 



O hg 3 £ + S=(log£) (\og£ + loge- 1 )^) = 6 (- 



V5 



(55) 
□ 



4 Conclusions 

We have shown how to quantumly speed up a classical FPRAS for approximating partition 
functions, as measured in the number of times we have to invoke a step of a quantum walk 
(instead of classical Markov chains). We obtained two reductions in complexity: 1/5 —> 1/ V5 
and 1/e 2 — > 1/e. These two reductions are intimately related; they cannot occur separately. 
If we used quantum samples merely to obtain classical samples (i.e., if we tried to estimate 
the ratios without phase estimation), then this would lead to 0(£ 3 ) dependence (for e oc t^ 1 ). 
This is because we would have to take 0(4) classical samples for each i and producing a 
quantum sample costs at least 0(£). The advantage of our approximation procedure based 
on quantum phase estimation is that it requires only one quantum sample (or more precisely, 
log£, after using the powering lemma to boost the success probability). We cannot obtain 
the second speed-up without using quantum samples (as mentioned in the introduction, this 
prevents us from using a procedure such as [H] that prepares density operators encoding 
stationary distributions). Also, the arguments employed in the error analysis in the quantum 
case are quite different from those in the classical error analysis. 

Each classical FPRAS we speed up uses the telescoping trick (jlj), a particular cooling 
schedule (decreasing sequence of temperatures), and slowly- varying Markov chains which 
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mix rapidly, with stationary distributions equal to the Boltzmann distributions at the inter- 
mediate temperatures. The classical FPRAS is useful only when we have the Markov chains 
with the required properties. Moreover, the cooling schedules need to be such that the ratios 
OLi Qj are lower bounded by some c™ 1 . In [17], the authors show that it is possible to use 
a cooling schedule T — oo > T[ > . . . > T'ft_ x = Tp for estimating the partition function 



Z(Tp) as long as for each i, 



E(V?) <6, (56) 



(E (>',))' 

where b is some constant. Such a cooling schedule is called a Chebyshev cooling schedule. 
Note that the above condition is automatically satisfied in the situation we consider in 
this paper, but not vice versa (recall that we assume that we have a cooling schedule such 
that E (Yi) is bounded from below by a constant for each i; we set it to ~ for simplicity 
of presentation). The advantage of Chebyshev cooling schedules is that they are provably 
shorter. The authors present an adaptive algorithm for constructing Chebyshev cooling 
schedule. We plan to explore if it is possible to speed up this process. But even if this 
is possible, a potential obstacle remains. It is not clear whether we can still obtain the 
reduction from -4 to - when we only know that the condition fl56|) is satisfied. It seems that 
the condition E (Yi) > c~ x with c some constant is absolutely necessary for phase estimation 
to yield the quadratic speed-up with respect to the accuracy parameter e. 

The combination of simulated annealing and the Monte Carlo Markov Chain method 
used in approximating partition functions is the central piece of the best currently known 
algorithm for estimating permanents with non-negative entries [9]. We therefore plan to 
explore where our techniques can be used to speed up this breakthrough classical algorithm. 
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A Quantum Walks from Classical Markov Chains 

The class of classical approximation schemes that we speed up uses reversible, ergodic Markov 
chains Pi with stationary distributions 7Tj. Here we briefly review the quantum analogue of 
a Markov chain, describing the quantum walk operator W corresponding to the classical 
Markov Chain P. 

In each step of a Markov chain P with state space Q, the probability of a state x to 
transition to another state y is given by the element p xy of the D x D transition matrix, 
where D = \Q\. Following Szegedy [3], for each such Markov Chain, we can define its 
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quantum analogue. The Hilbert space on which this quantum operation acts is C (g> C , 
with two C D registers. We start by defining the states 

\px) = J2Vp^\v) ■ ( 57 ) 

yea 

These states can be generated by a quantum update - any unitary U that satisfies 

U \x) \0) = \x) \ Px ) (58) 

for some fixed state G Q and all The quantum analogue of a Markov chain is then 

defined as follows. 

Definition 1 (Quantum Walk). A quantum walk W(P) based on a classical reversible 
Markov chain P is a unitary operation acting on the space C D (g> C D as 

W{P) = R B ■ Ra, (59) 

where R& and R^ are reflections about the spaces 

A = span{|a;) |0) : x G Q}, (60) 
B = U^SUA, (61) 

and S is a swap of the two registers. 

This particular definition of the quantum walk is suitable for making some of the proofs 
in [10] easier. It is equivalent to the standard definition of Szegedy [3J up to conjugation by 
U. Therefore, the spectral properties of our W and Szegedy's quantum walk are the same. 

Let 5 be the spectral gap of the classical Markov chain P. Let us write its eigenvalues as 
fi = 1 and fij = cos(9j), for j = 1, . . . , D — 1 and 9j G (0, |). According to Szegedy [3], on 
the space A + B, the eigenvalues of the quantum walk W(P) with nonzero imaginary part 
are e ± j . The phase gap of the quantum walk W(P) is then defined as A = 29 1 (with 9\ 
the smallest of 9j). When the Markov chain is ergodic and reversible, Szegedy proved that 

A > 2VS, (62) 

a quadratic relation between the phase gap A of the quantum walk W(P) and the spectral 
gap S of the classical Markov chain P. This quadratic relation is behind the speed-up of 
many of today's quantum walk algorithms. 
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